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The clustering of matter on cosmological scales is an essential probe for studying the physical ori- 
gin and composition of our Universe. To date, most of the direct studies have focused on shear-shear 
weak lensing correlations, but it is also possible to extract the dark matter clustering by combining 
galaxy-clustering and galaxy-galaxy-lensing measurements. In order to extract the required informa- 
tion, one must relate the observable galaxy distribution to the underlying dark matter distribution. 
In this study we develop in detail a method that can constrain the dark matter correlation function 
from galaxy clustering and galaxy-galaxy-lensing measurements, by focusing on the correlation co- 
efficient between the galaxy and matter overdensity fields. Our goal is to develop an estimator that 
maximally correlates the two. To generate a mock galaxy catalogue for testing purposes, we use 
the Halo Occupation Distribution approach applied to a large ensemble of iV-body simulations to 
model pre-existing SDSS Luminous Red Galaxy sample observations. Using this mock catalogue, we 
show that a direct comparison between the excess surface mass density measured by lensing and its 
corresponding galaxy clustering quantity is not optimal. We develop a new statistic that suppresses 
the small-scale contributions to these observations and show that this new statistic leads to a cross- 
correlation coefficient that is within a few percent of unity down to 5 /i~ Mpc. Furthermore, the 
residual incoherence between the galaxy and matter fields can be explained using a theoretical model 
for scale-dependent galaxy bias, giving us a final estimator that is unbiased to within 1%, so that we 
can reconstruct the dark matter clustering power spectrum at this accuracy up to k ~ 1 /iMpc -1 . 
We also perform a comprehensive study of other physical effects that can affect the analysis, such 
as redshift space distortions and differences in radial windows between galaxy clustering and weak 
lensing observations. We apply the method to a range of cosmological models and explicitly show 
the viability of our new statistic to distinguish between cosmological models. 
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I. INTRODUCTION 

The current paradigm for the history of our Universe, 
also known as the ACDM cosmology, comes along with 
dark ingredients that have not yet been directly detected: 
Cold Dark Matter (hereafter CDM) and Dark Energy Q. 
CDM particles constitute about 20% of the total energy 
budget of the Universe, and whilst there is no confirmed 
direct detection of them in a laboratory experiment, the 
indirect astrophysical evidence supporting their existence 
is substantial. However, even more puzzling is the exis- 
tence and true physical nature of Dark Energy, which 
contributes roughly 75% of the total energy budget of 
the Universe and is responsible for driving the late-time 
accelerated expansion of spacetime. 

The dark matter power spectrum and its real space 
equivalent, the correlation function, contain a wealth of 
cosmological information, e. g. on neutrino mass, dark 
energy equation of state and the initial conditions of the 
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Universe. Thus it is a key goal of cosmology to infer 
these quantities from observables. However, to achieve 
this requires a solid understanding of the galaxy bias - 
the relation between the observable galaxies and the un- 
derlying dark matter density field. This understanding 
is especially important for the interpretation of ongo- 
ing and upcoming surveys, such as SDSSjlJ, DES[7c|. 
PanSTARRS[z3 and EUCLID 0. 

The reconstruction of the CDM distribution is usually 
based on the assumption, that galaxies trace the matter 
density field, i. c. that on large scales the galaxy density 
field equals the matter density field times a parameter 
known as the bias. The resulting galaxy correlation func- 
tion can then be expressed as 

£ gg (r) = fo 2 6n m (r), (1) 

and similarly for the power spectrum in fc-space. The 
subtlety in the standard approach is that the bias has to 
be determined empirically, leading to uncertainties in the 
amplitude of the matter correlation, which finally compli- 
cates studies of the rate of change of matter fluctuations 
with time (the growth factor). Furthermore there is ev- 
idence for a non-trivial scale dependence of galaxy bias 
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041] ■ Hence it is of great importance to devise meth- 
ods that allow a direct reconstruction of the dark matter 
correlation function from observables. One of the most 
promising observational probes of dark matter on cosmo- 
logical scales is the gravitational lensing. 

We will focus our attention on a specific weak lensing 
technique, halo-galaxy lensing, which involves measure- 
ment of the shape distortions around foreground dark 
matter haloes in which galaxies form. Often the fore- 
ground object (lens) will be an individual galaxy, in 
which case this technique is called galaxy-galaxy lens- 
ing, but it can also be applied to groups and clusters. 
Since the first attempts to detect galaxy-galaxy lensing 
by @ , the quality of the data has been improved vastly by 
deeper and wider surveys. Halo-galaxy lensing has now 
been measured with relatively high signal-to-noise and 
as a function of a wide variety of properties of the lens 
galaxies, groups and clusters |7Hl0t . It has become clear 
in these studies that galaxy-galaxy lensing contains much 
information about the mass distribution around galaxies, 
and has the potential to measure dark matter halo radii, 
shapes, concentrations and masses [ITl - flij as well as the 
distribution of matter within the Universe (l5l - fl9| . 

The interpretation of the signal in terms of the link 
between galaxies and dark matter is, however, compli- 
cated by the fact that (except for galaxy clusters) galaxy- 
galaxy lensing is only detectable by stacking the signal 
from many lenses. Theoretical modelling of the galaxy- 
galaxy lensing has been done both with numerical sim- 
ulations [H |2l| and with the halo model [H, The 
combination of lensing and clustering seems to hold the 
potential to put constraints on cosmological parameters 

mM- 

In this paper our main objective is to develop a method 
that recovers a statistic closely related to the matter 
correlation function from a joint analysis of lensing and 
clustering observations. This method is presented to- 
gether with a theoretical motivation and tests on simu- 
lated galaxy samples. 

The starting point for these simulated galaxy samples 
are cosmological iV-body simulations, which are a stan- 
dard tool to investigate the non-linear evolution of the 
CDM density field. Despite their statistical power for 
describing the large scale structure of the Universe, pure 
dark matter simulations have the disadvantage that one 
must supplement them with a prescription for galaxy for- 
mation in order to reproduce the surveyed galaxy distri- 
butions. We work in the standard paradigm of hierarchi- 
cal galaxy formation: galaxies only form in dark matter 
haloes |26|. Hence, the problem is reduced to that of 
relating galaxies to dark matter haloes, and we do this 
using the Halo Model approach and in particular the Halo 
Occupation Distribution (for a review see (27j). Here we 
are focused on obtaining mock galaxy catalogues for the 
Luminous Red Galaxies (LRGs) , a subset of galaxies ob- 
served with the Sloan Digital Sky Survey (SPSS). Our 
modelling builds on earlier approaches by [5^, [29| . 

The paper breaks down as follows: in EjXT] we review 



the basics of weak gravitational lensing, an important 
probe of the dark matter on cosmological scales. Then 
in in i)IIII we introduce our main analysis tool, the cross- 
correlation coefficient. Theoretical modelling of the latter 
is carried out in ^IVI In SjV]we describe the simulations 
and the mock galaxy catalogues that we use to test our 
new method. The results of the numerical studies on the 
cross-correlation coefficient and the reconstructed mat- 
ter statistic are discussed in WI1 The effect of rcdshift 
space distortions and radial window functions on the ob- 
servational implementation of our method are explored 
in Will 3 Villi is devoted to the cosmology-dependence 
of our results. Finally, in ^IXI we will summarise and 
discuss our findings. 



II. OBSERVABLES 

A. Halo-Galaxy Lensing 

Weak gravitational lensing is one of the main probes 
for the dark matter distribution in the Universe (see (30l — 
l32| for reviews). In this study we focus on a specific weak 
lensing technique known as halo-galaxy or galaxy-galaxy 
lensing. In this technique, one infers the tangential shear 
7t around foreground objects from the deformation of 
background galaxy images. Since the shear is weak, one 
must average over a large number of background galaxies 
to obtain good signal to noise. The estimated jt can 
then be related to the projected mass distribution around 
the foreground lens galaxies. The key quantity is the 
differential excess surface mass density [32|, [33| , 

AE gm (i?) = E gm (i?) - E gm (i?) = E crit ( 7t (i?, tp)) v , (2) 

where E gm is the projected surface mass density, R « 
9D\ is the comoving transverse distance between lens and 
source galaxies with angular separation 6, and subscripts 
g and m refer to galaxies and mass, respectively [80(. 
In the above equation we also introduced the comoving 
angular diameter distance to the lens galaxy A and the 
mean surface mass density within a circular aperture, 
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The critical surface mass density 
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is a geometrical factor with A, A) As being the angu- 
lar diameter distances to the source, the lens and be- 
tween lens and source, respectively. Galaxy-galaxy lens- 
ing stacks the signal of large numbers of foreground and 
background galaxies and thus E cr it has to be understood 
as an effective quantity for the lens and source distribu- 
tion. It is sensitive to the cosmological model, including 
the matter density parameter f2 m . 
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Since the deflections are measured around foreground 
galaxies, the mass profile is directly related to the galaxy- 
matter cross-correlation function 
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mix) 



(5) 

with integration along the line of sight \- The criti- 
cal density is defined as p C rit(a) = 37J 2 (a)/87rG, where 
H(a) = a/a is the Hubble parameter. Here we include 
the radial window function gi(x) (see e.g. [22[) that de- 
scribes the dependence of lensing strength on the dis- 
tribution of the lens mass and depends on the lens and 
source positions. Note that the additional constant 1 
in the integrand drops out on computing A£ gm (i?) with 
Eq. ©. 

In principle the excess surface mass density A£(i?) 
could be integrated to yield the projected galaxy-matter 
correlation function w(R), which in turn can be depro- 
jectcd to using an Abel formula. Lensing obser- 
vations are, however, subject to noise, that is ampli- 
fied when reconstructing the correlation function ^gn^r). 
Consequently, we try to minimise the manipulations on 
the data, and rather transform theoretical predictions ac- 
cordingly. 



B. Projected correlation functions 

In addition to the mass distribution around galaxies 
one may also observe the distribution of galaxies them- 
selves. A convenient way to quantify the clustering be- 
tween the tracer fields A and B is the projected correla- 
tion function 13411 . 



wab(R) 



qMUbWx 2 + R 2 )dx, (6) 



where g g {x) is a window function and where for instance 
we are interested in: AB = {gg, gm, mm}. The line of 
sight integration partially removes rcdshift space distor- 
tions, which are an issue in the three dimensional cor- 
relation function £(r) (see Will for a discussion of the 
residual effects). Based on the projected galaxy cluster- 
ing w gg (R), we now define two statistics that correspond 
more closely to the lensing observable jt- 



AS gg (i?) = Pcrit [w gg {R) - w ee (R)] 



(7) 



A£ mm (i?) = O^pcrit [w mm (R) - w mm (R)] , (8) 

In these equations, we have multiplied by the critical den- 
sity in order to achieve the same dimensions as AE gm . 
The prcfactor £7, 2 n in the definition of AE mm accounts 
for the fact that it is a two-point statistic of matter den- 
sity. Lensing is sensitive to the total density of matter 
p m (which is proportional to r2 m ), while for galaxy clus- 
tering we usually remove the dependence on the mean 
density of galaxies and work only with the density con- 
trasts S s = (p g - pg")/pg". 



So far we have not specified the window functions for 
the line of sight integrations in Eqs. §5§ and gi(x) 
and g g (x)- I n galaxy-galaxy lensing the inhomogeneous 
mass distribution between the observer and the source 
contributes to the final distortion. Consequently the win- 
dow for lensing is typically very broad and is fixed by the 
geometrical setup of the source-lens-observer system, i. c. 
the radial distribution of the lens and source samples. 
For galaxy clustering studies, when provided with accu- 
rate redshifts, the window function can be constructed 
straightforwardly and we shall assume a narrow top-hat 
around the lens positions. We take the thickness of the 
top-hat to be A% ss 100 h~ 1 Mpc, which is a compro- 
mise between adding uncorrelated noise and increasing 
the signal. 

To simplify our investigations further we measure 
{A£g g (i?), AS gm (i?), AS mm (i?)} from our simulations 
with top-hat window functions of the same length. The 
estimates are obtained in real space and we quantify the 
effects of window functions, integration lengths and red- 
shift space distortions on the result separately in Will 
This approach enables us to disentangle the intrinsic 
properties of the mass and tracer fields and the system- 
atic effects induced by the measurement technique. 

Note that since our main goal is to develop an algo- 
rithm for reconstructing the mass clustering, we have also 
assumed that the correlation function is estimated over a 
region of space where the galaxy selection function does 
not vary significantly, hence one must be careful when 
applying it to the galaxies close to the edge of the sur- 
vey. 



III. CROSS-CORRELATION COEFFICIENT 

The cross-correlation coefficient between two density 
fields A and B may be defined using the correlation func- 
tion £ as [HI 
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and is a measure of the statistical coherence of the two 
fields [35l - l39j . If r cc = 1 then the fields are fully cor- 
related and there exists a deterministic mapping be- 
tween the fields. This behaviour would be expected 
for any scale-dependent, deterministic, linear bias model 
of haloes or galaxies: £ gin (r) = 6(r)£ mm (r), £ gg (r) = 
b 2 (r)£ mm (r). On the other hand, if r cc 7^ 1 then the 
fields are incoherent, and for the local model of galaxy 
formation, this may arise due to stochasticity and non- 
linearity in the bias relation j37| . The r cc constructed 
from real-space statistics can be > 1 (unlike in Fourier 
space), since £ gg has the shot noise subtracted off; this 
behaviour will be seen in several places in this work. 



Studying the cross-correlation coefficient 
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of the 



haloes in the numerical simulations used for this work 
we find that the cross-correlation coefficient of haloes is 
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close to unity on large scales and decreases below unity 
on scales below 10 /i _1 Mpc similarly for a large range of 
halo masses 1.3 x 10 13 ft^M© < M < 3 x 10 15 h~ l M & 

As was already mentioned, it is a key goal of cosmol- 
ogy to recover the dark matter correlation function from 
the observations. In this context it is important to quan- 
tify how well galaxies trace the underlying dark matter 
density field, which inspires us to examine the cross- 
correlation coefficient between the matter and galaxy 
fields. One approach is to measure the excess surface 
mass density from galaxy-galaxy lensing using Eq. (|5|). 
In this case, we define cross-correlation coefficient by re- 
placing the correlation functions in Eq. ((9]) with the cor- 
responding excess surface mass densities, 
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(10) 



Due to our definition of AE mm and A£ gg the prefactors 
H m cancel and the resulting statistic is only dependent 
on the ratio of the correlation functions. Thus rcf^m(-R) 
is expected to approach unity on linear scales. 

The excess surface mass density AE(i?) measures the 
difference between the surface mass density averaged over 
an aperture of radius R and the actual value at the 
boundary of the aperture. Consequently it combines in- 
formation from small scales, which are highly non-linear 
and stochastic, and larger, linear scales, where stochas- 
ticity is believed to be small. To remove part of the 
incoherence introduced by the non-linear clustering pro- 
cess we introduce a new statistic T(i?), that we call the 
Annular Differential Surface Density (hereafter ADSD). 
This statistic eliminates the contributions to A£(i?) from 
small scales as follows: 

T(R;R ) ee A£(i?) - |f A£(i? ) ; (11) 
= A J\r'R'E(R') 

-jp[R 2 X(R) - RtZiRo)]. (12) 

Setting the cutoff radius to Ro — the new statistic 
T reduces to AS. Note that T is completely indepen- 
dent of the correlation function on scales below R . Our 
motivation in subtracting out small-scale contributions 
was to recover a statistic that does not mix small and 
large scales. Thus we suggest the choice Ro ~ 2r vir , 
where r v - n - is the average virial radius of the host haloes of 
the galaxy sample under consideration. On scales below 
two virial radii, the intra-halo non-linear clustering dom- 
inates, whereas the weakly non-linear scales exceeding 
2r v j r can be modelled by simulations and perturbation 
theory. We suggest a conservative choice of Rq to avoid 
problems in the transition region between small and large 
scales even if the signal-to-noise ratio is slightly degraded. 
The virial radii have to be inferred from a mass estima- 
tor such as X-ray or gravitational lensing. The latter has 
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FIG. 1: Top panel: Excess surface mass density AE(i?) (solid) 
and ADSD T(R;R ) (dashed) with R = 3 /i _1 Mpc for 
our fiducial cosmological model and the luminosity-threshold 
LRG sample. We show the statistics for the galaxy auto- 
correlation (top red), the galaxy-matter cross-correlation 
(central green) and the matter auto-correlation (bottom 
blue). The upturn of the cross-correlation towards small 
scales leads to a cross-correlation coefficient in excess of unity 
as we will see later. Bottom panel: Cross-correlation coeffi- 
cient of the clustering statistics shown in the top panel. The 
bare excess surface mass density (solid) leads to strong devi- 
ations from unity, whereas the ADSD with _Ro = 3 /i -1 Mpc 
(dashed) recovers a cross-correlation close to unity. 



the advantage that the same observation can be used to 
infer the mass and the ADSD statistic. In a companion 
paper [42j we show that the ADSD, with a cutoff radius 
Ro ~ 0.25r v j r , can also be used to avoid statistical and 
systematical uncertainties about the inner parts of the 
halo profiles and thus is a viable tool to calculate cluster 
masses using an iterative procedure. 

One may calculate the cross-correlation coefficient of 
the ADSD 



(R) 



T gm (i?) 



\/Ygg(.R)T mm (.R) 



(13) 



In Fig. [TJ we plot both the excess surface mass den- 
sity and the ADSD T defined from the galaxy auto- 
correlation, matter auto-correlation and their cross- 
correlation. As galaxies we choose a model for Luminous 
Red Galaxies (LRGs), as discussed in more detail in [jV] 
We observe that the galaxy-galaxy and galaxy-matter ex- 
cess surface mass densities are not multiples of the matter 
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correlation function on small scales, so that we expect a 
cross-correlation different from unity for the bare statis- 
tic. This result is seen in the bottom panel of Fig. [TJ 
where the deviations from unity extend to scales above 
10 /i _1 Mpe. Subtracting the signal at Rq = 3 /i _1 Mpc 
as in Eq. [12] to get T, we remove these non-linearities 
and recover similar shapes for all three functions. As a 
result, the cross-correlation coefficient is now much closer 
to unity on all scales above Rq, as seen in the bottom of 
Fig.ffl 

Both the projected correlation function and the ADSD 
are defined by integrals of the correlation function 
weighted by a kernel. The projected correlation function 
can be written as 



w(R)= £(x)W w (x;R)xdlnx 
Jo 

where the window function is written, 
2x 



(14) 



Here Q(x) is the Heaviside step function. Thus w(R) has 
contributions only from scales x > R, and the window 
function is peaked at x = R. 



The ADSD defined in Eq. ([12]) involves a radial av- 
erage and subtraction of w(R). Both operations can be 
included in the integration kernel, and we may write the 
ADSD as 
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£,{x)W T {x; R, R )x d In x . (16) 
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®{x — R)Q(\/Xmax + R 2 — x) . (15) where the window function for T(R,Rq) is written, 
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(17) 



The scale dependence of the integration kernels reveals 
the scales in the correlation function that are dominating. 

In Fig. [5] we show the window function Wr(x;R,Ro) 
for three different radii and Rq = 3 /i _1 Mpc. Since 
approximately follows a decreasing power-law, the 
leading contribution is at the scale R, where the sign 
changes. This sign-change is due to the subtraction 
ASab = p\wab(R) — wab(R)} and is the same as for 
AS. For AS the window is exactly compensated, mean- 
ing that it integrates to zero, hence AE is insensitive to 
adding a mean density component, the so called mass 
sheet degeneracy. This compensation is fortunate, since 
it means that this statistic is insensitive to the long wave- 
length modes that can move w(R) up and down, i. c. the 
long wavelength sampling variance affects w(R) on all 
scales. The compensated window also makes AE less 
sensitive to the redshift space distortions, as discussed in 
Will The statistic T, though not exactly compensated, 
retains most of these beneficial properties, while at the 
same time eliminating small scale clustering information. 

The scales probed by the statistic T are however more 
obvious in the power spectrum. The conversion from 
P(k) — > T(R; Rq) can be written as 



T(R;R ) 



P(k)kW?(k; R, R ) din A;, 



(18) 



where the window function is no longer given by a simple 
analytical form due to the spherical Bessel functions oc- 



curring in the Fourier transform. In Figure [5] we show 
this window function multiplied with the power spec- 
trum. From this plot we see that for a cutoff radius 
Rq = 3 /i _1 Mpc, T essentially probes scales down to 
k rj 1 h Mpc™ 1 . The integrand in ([T5]) is peaked at the 
scale k ps tt/R, and is strongly oscillatory on small scales. 

For later use we plot the window function for the corre- 
lation function W^(k;r) = k 2 sin kr/kr that relates P{k) 
and f(r) via £(r) = V/(2ir) 3 J kP(k)W ( (k; r) dlnfc. 



IV. THEORETICAL MODELLING OF SCALE 
DEPENDENT BIAS 

In this section we will use cosmological perturba- 
tion theory (for a review see [ilj) to predict the cross- 
correlation coefficient. Our discussion is based on a Tay- 
lor expansion of the galaxy density field in terms of the 
matter overdensity S 
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Pe = po + p' S + -p'>5 2 + -p' "6 3 + e + 0(6+) (19) 

Such an expansion is only valid on scales exceeding the 
virial radius of dark matter haloes, since it contains no 
mechanism of halo exclusion or the radial distribution 
of galaxies within their host halo. Absorbing potentially 
di verg ent terms into the bias and shot noise parameters, 
[H, |44| showed that the auto- and cross-power spectrum 
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FIG. 2: Left panel: Window functions for T(R;Ro) with Ro = 3 h 1 Mpc in real space. We show the window function for 
R = 5,10,20 /i _1 Mpc as green solid, orange dashed and red dash-dotted lines respectively. Right panel: Window functions 
for T(R;R ) with R = 3 /i _1 Mpc in k -space. We show the products of window function and power spectrum for R = 
5, 10,20 /i _1 Mpc as green solid, orange dashed and red dash-dotted line respectively. For reference we also plot the window for 
the correlation function as thin lines. Note that we multiplied with x and k respectively to account for the logarithmic scale 
on the ordinate axis. 



of a biased tracer field can be written up to fourth order 
in the matter density field as 



Pgm(fc) = b 1 P mJ (k) + b 2 A(k), 
P gg (k) = b 2 1 P NL (k) + 2b 1 b 2 A{k) 



bl 



(20) 

B(k) + N(2l) 



where N is the renormalized shot noise, b\ and b 2 are the 
rcnormalized bias parameters and Pnl is the non-linear 
power spectrum. The calculation of the latter can be 
carried out with any perturbative technique, e. g. stan- 
dard perturbation theory (43j , renormalized perturbation 
theory (45l . |46| or Lagrangian perturbation theory. The 
advantage of this renormalisation of the bias parameters 
is that there is no artificial smoothing scale involved in 
the above expansion. However this comes at a price: the 
bias and shot noise are no longer given ab-initio from 
theory, but have to be determined empirically. 

The correction terms A(k) and B{k) introduced in the 
above equation arc defined as 



A(k) = 

B(k) = 
where 
F 2 {ki,k 2 ) 



d 3 q 
(2tt) 3 

d 3 g 
(2tt) 3 



Piin(q)Piin(\k-q\)F 2 (q,k-q),(22) 

flin(|9|)[flin(|fc-9|)-flin(?)] ,(23) 
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is the second order standard mode coupling kernel. In 
the above integrals, one can use the linear power spec- 
trum, because the integrals are already fourth order in 
the matter density field S. 

Due to the linearity of the expressions in Eqs. (f2T)]) and 
(f2"TT) the transformation to real space is straightforward 

£gmM =b 1 ^ L (r)+b 2 A(r) , (25) 
4g(r) =&?£nlM + 2b 1 b 2 A(r) + ^B(r) (26) 

with ( mm (r) = £(r), and A(r), B(r) being the Fourier 
transforms of A(k), B(k) respectively. It is easy to show 
that B(r) = £ 2 — <r 2 S D (r), where a is the variance of the 
power spectrum. Figure [3] shows the terms contributing 
to the galaxy auto- and cross-correlation functions as well 
as a fit to the correlation functions measured in our nu- 
merical simulations. The A(r) term is positive on small 
scales and changes sign at r rj 6 /i _1 Mpc. The B(r) term 
affecting the auto-correlation dominates over the matter 
correlation function on scales below r < 4 /i _1 Mpc. 

Let us for later convenience define the parameter com- 
bination 



V 



(27) 



(24) 



As shown by [41J in the regime where A{r) <C £( r ) and 
B(r) <§; £(r) the cross-correlation coefficient can be writ- 
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FIG. 3: Left panel: Non-linear corrections to the real space correlation function as function of radial separation calculated for 
redshift z = 0.23. We show the linear (black dotted) and non-linear matter correlation function (thick blue solid) as well as 
the B (red dash-dotted) and A (green solid and dashed) correction terms. The dashed portion of the graph of A(r) denotes 
the range where it is negative. Right panel: Perturbation theory fit (dashed) over scales 6 /i _1 Mpc < r < 80 /i -1 Mpc to the 
measured galaxy-correlation functions (solid) of the luminosity-threshold sample. The fits to £gg (upper green), £ gm (central 
blue) and a joint fit provide consistent results. We are not expecting a good agreement on scales below r ~ 3 ft -1 Mpc, where 
the correlation function is dominated by non-linear clustering. 
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FIG. 4: Left panel: Perturbation terms for the projected surface mass density w. We show the non-linear matter correlation 
function (black dotted) as well as the wb (red dash-dotted) and wa (green dashed and solid) terms. Note that the dashed 
part of the latter curve is negative. Right panel: Perturbation terms for the ADSD and Ro = 3 /i _1 Mpc. Again we show the 
non-linear matter statistic (black dotted) together with the Ta (green dashed and solid) and Tb (red dash-dotted) terms. Note 
that the scale at which the Tb term becomes comparable to the non-linear correlation is shifted further out to R tts 9 /i -1 Mpc. 



8 



ten as: 



\A 



y/ffi + 2aA + a 2 B/2) ' 

1 ,B 9 A 2 1 oAB 
il - -a 2 — - a 2 — - -a 6 — 

£2 4 £2 



-A - -a 



1 2^0 

C(r) 



<1 - -a^(r) 



(28) 

(29) 
(30) 
(31) 



Obviously the model predicts a scale dependent cross- 
correlation that is below unity on small scales and asymp- 
totically approaches unity for increasing r. As shown 
in [Jlj], the shape of the cross-correlation coefficient of 
haloes measured in the simulations is well described by 
the functional form of the above equation, and the prefac- 
tor a is a weak function of halo mass. Clearly this simple 
theoretical model is not able to cover the non-linear be- 
haviour inside the virial radius after shell-cros sing . Qual- 
itatively similar predictions were presented by |47| for the 
peak model of [48[ . 

We will now proceed to develop the results repeated 
here for the readers' convenience for use in our investi- 
gations. A result similar to Eq. (|3"Tj) remains valid if we 
consider the projected correlation function, since the in- 
tegration along the line of sight is a linear operation. We 
therefore have: 



w(R) = 
w A (R) = 
w B (R) = 



"Xmax 
+ Xma> 



£(x/R 2 + x 2 )d x ; 



A(y/R? + X 2 )dx 



B(^/R 2 + X 2 )dx 



(32) 
(33) 
(34) 



Furthermore, the manipulations that lead to the excess 
surface mass density, or more generally to the ADSD 
f (r) -> AE(ii) -> T(i?) are linear in the fields A, B and 
£. Consequently the corresponding terms have the same 
order as their underlying statistic and we can write, 



1 



(R) = l- -a 



T B (R) 

"T mm(-R) 



(35) 



Note that to evaluate the term Yb(-R) we only need to 
replace £(r) with £ 2 (r) in Eq. (fT6|) . The effective value 
of a = (b 2 ) I (bi) for our galaxy catalogues can be esti- 
mated using the mean bias parameters from the peak- 
background-split [49l l50l] . 



J n(M) (N(M))bi(M)dM 
Jn(M) (7V(M))dM 



i = 1,2, 



(36) 



where n(M) is the halo mass-function and (N(M)) is the 
halo occupation number. For the rest of this work we will 
adopt a = 0.26, which is close to peak-background split 



predictions [4l|. An alternative approach, accounting 
for the renormalised nature of the parameters, would be 
to fit for the model parameters by matching theoretical 
and measured correlation functions as shown in the right 
panel of Figure [3l This second approach provides results 
that are consistent with the pcak-background-split result. 

In Figure 0] we plot the correction terms contributing 
to the projected correlation function and the ADSD. The 
prerequisite for the Taylor expansion to be applicable is 
that the correction terms wx/wmm <C 1 and WB/w mm <C 
1 or TA/T mm <C 1 and TB/T mm <C 1. We see that these 
assumptions are violated below R = 3 /i _1 Mpc for the 
projected correlation function and below R — 9 /i _1 Mpc 
for the ADSD. For the non- linear correlation function £ 
we use the matter correlation function measured in the 
simulations. 



V. NUMERICAL MODELLING 

With modern large supercomputers and well developed 
algorithms it is now possible to model the evolution of 
the dark matter density field on cosmological scales rea- 
sonably well. However, what one observes are not dark 
matter haloes but galaxies. Supplementing the distribu- 
tion of dark matter in a simulation box with the galaxy 
distribution corresponding to a particular galaxy sample 
would in principle require an understanding of the pro- 
cess of galaxy formation. An ab initio treatment of all 
the baryonic processes is difficult, and requires, e. g. full 
treatment of the hydrodynamics, atomic and radiative 
heating and cooling of gas at high resolution. Owing to 
the large computational cost, state of the art simulations 
are restricted to relatively small scales and lack sufficient 
volume to extract statistically relevant information on 
cosmological scales [HT| ■ Thus we pursue a statistical 
approach to populate the haloes identified in a suite of 
large-scale A^-body simulations with galaxies. 



A. The Simulations 

Our numerical results are based on the Zurich hori- 
zon "zHORIZDN" simulations, a suite of 30 pure dissi- 
pationlcss dark matter simulations of the ACDM cos- 
mology in which the matter density field is sampled by 
N p = 750 3 dark matter particles. The box length of 
1500 /i~ 1 Mpc, together with the cosmological parame- 
ters given in Table U then implies a particle mass of 
M<im = 5.55 x 10 11 h~ 1 M (3 . This simulation volume 
enables high precision studies of the fluctuations in the 
ACDM model on scales up to a few hundred comoving 
megaparsecs [52| . 

The simulations were carried out on the ZB0X2 and 
ZBQX3 computer-clusters of the Institute for Theoreti- 
cal Physics at the University of Zurich using the pub- 
licly available GADGET- 1 1 code The force softening 
length of the simulations used for this work was set to 
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60 /i _1 kpc, consequently limiting our considerations to 
larger scales. The transfer function at redshift z = 
was calculated using the CMBFAST code of and then 
rescaled to the initial redshift z, = 50 using the linear 
growth factor. For each simulation, a realisation of the 
power spectrum and the corresponding gravitational po- 
tential were calculated. Particles were then placed on 
a Cartesian grid of spacing Ax = 2 ft, _1 Mpc and dis- 
placed according to a second order Lagrangian pertur- 
bation theory. The displacements and initial conditions 
were computed with the 2LPT code of [H, [56| . 

The cosmological parameters for the simulations were 
inspired by the best fit values released by the WMAP3 
analysis of the cosmic microwave background [13, l58j . 
and can be taken from Table [U Throughout the paper 
we adopt this parameter set as our fiducial cosmological 
model. 

Our effective volume is V — 27 /i" 3 Gpc 3 . For each of 
the simulation outputs, gravitationally bound structures 
were identified using the B-FoF algorithm kindly provided 
by Volker Springel. The linking length in this Friends- 
of-Friends halo finder was set to 0.2 of the mean inter- 
particle spacings, and haloes with less than 20 particles 
were rejected. All together we resolve haloes with M > 
1.2 x 10 13 h- x M P) . 
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CI 


0.25 


0.75 


0.7 0.8 


0.95 


— 1 


4 


C2 


0.25 


0.75 


0.7 0.9 


1.0 


-1 


4 


C3 


0.2 


0.8 


0.7 0.8 


1.0 


-1 


4 


C4 


0.3 


0.7 


0.7 0.8 


1.0 


-1 


4 



TABLE I: Cosmological parameters adopted for our in- 
vestigations. Matter density parameter, dark energy den- 
sity parameter, dimensionless Hubble parameter Ho = 
100ft km s _1 Mpc" 1 , power spectrum normalisation, pri- 
mordial power spectrum slope, dark energy equation of state 
p = cop, number of simulation outputs. The first line is our 
fiducial model. In order to evaluate the cosmology depen- 
dence of our results we use four other cosmologies denoted as 
C1-C4. 



B. HOD Modelling I - Luminosity Threshold 
Sample 

The statistical model used to populate the haloes with 
galaxies is known as the Halo Occupation Distribution 
(HOD), which is closely related to the Halo Model of 
Large Scale Structure (for a review see [27j). The HOD 
assumes that galaxies form in the dark matter poten- 
tial wells, because only there can baryons cool with suffi- 
cient efficiency. To translate this idea into a quantitative 
model, one must fix the following ingredients: 

1. Number of galaxies that occupy a halo of mass M 

2. Radial distribution of galaxies within the halo 



Theories of galaxy formation suggest a division into cen- 
tral and satellite galaxies. Central galaxies are those that 
reside at the minimum of the potential well for host dark 
matter haloes. In contrast, satellite galaxies orbit the 
central galaxy and are presumed to have their own asso- 
ciated subhalo within the larger host halo. Furthermore 
we will assume that the number of satellite galaxies is a 
function of host halo mass only and neglect any environ- 
mental influences. The basic assumption of the model is 
that bright galaxies will not be able to live in low mass 
haloes, since there is not enough cold gas to form such 
galaxies. Equivalent ly, the halo mass can be represented 
by the virial radius r vlI , defined by the condition that the 
density within r v i r equals 200 times the critical density 

Pcrit ■ 

Let us start by considering the HOD required to model 
a luminosity-threshold sample of galaxies, and later in 
flV CI we will describe the necessary adaptations required 
for the more complex luminosity bin sample. 

First, we decide whether a halo of given mass contains 
a central galaxy at its potential minimum. It is rea- 
sonable to assume that a threshold in galaxy luminosity 
corresponds to a threshold in halo mass, but in practice 
it is necessary to take into account the scatter in the 
luminosity-mass relationship. We do this by appropri- 
ately smoothing the mass threshold. Following |29i. |59|. 
we take the mean number of central galaxies occupying 
a mass M halo to be: 



(N c , 



erfc 



In (M/M cut ) 
V2cr 



(37) 



where erfc(x) = 1 — crf(x) is the complementary error 
function, and M cut and a are parameters to be deter- 
mined from the data. This relationship is then used as a 
sampling probability for the Bernoulli distribution: op- 
erationally this amounts to drawing a random number, 
T € [0, 1], and if T < (N ccn ) then we place a central. This 
central galaxy is assumed to be formed by the baryons 
cooling in the dark matter potential well and subsequent 
collisions with satellite galaxies that approach the halo 
centre due to dynamical friction. 

Satellite galaxies that orbit the halo center mostly orig- 
inate from the merging of haloes already containing a 
central galaxy. Subhalo counts in high resolution TV-body 
dark matter simulations have shown that the number of 
satellite galaxies follows a Poisson distribution around an 
asymptotic power law (60| . Hence we take, 



M-M„ 
Mi 



, if M > M min A N ccn ^ 

otherwise , 

(38) 

which introduces another three parameters to be deter- 
mined: Mmi n , Mi and a. Finally, as a further constraint 
we impose the condition that satellite galaxies can only 
reside in haloes already containing a central galaxy. 

The satellite galaxies are expected to be situated in 
the subhaloes orbiting the halo centre. Our simulations 
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lack sufficient resolution to identify such dark matter 
substructures, so we instead sample the galaxy positions 
from the dark matter particle positions. Compared to a 
galaxy distribution following a profile this approach has 
several advantages. Firstly, we avoid the assumption of 
a functional form for the halo profile, instead profiting 
from the full triaxial dark matter distribution. Secondly, 
we can assign the dark matter particle velocities to the 
galaxies, which is useful for studies of the redshift space 
distortions. 

All together, we have five-dimensional parameter space 
spanned by {M cut , cr, M min , Mi, a}. We vary these five 
parameters in order to generate galaxy catalogues that 
can reproduce observed galaxy clustering and galaxy- 
galaxy lensing measurements for the two LRG samples 
described in [23| and WD\ below. For each point in pa- 
rameter space, we generate four galaxy catalogues per 
simulation using different random seeds. We then calcu- 
late the average of the clustering statistics of these four 
catalogues to remove some of the stochasticity intrinsic 
to the HOD model, and finally compare to the data. To 
reduce the dimension of the parameter space and thus 
the computational costs, we use the observed abundance 
of the LRG sample to impose a further constraint on the 
cutoff mass M cut by demanding 



n obs = / dMn(M) (N tot ) , 



(39) 



where n(M) is the halo mass- function |82| an d the mean 
total number of galaxies per halo is given by 



(JVtot) = (N cen ) [<J\U) + 1] , 



(40) 



where the form of (N ccn ) accounts for the scatter in the 
luminosity-mass relationship. Finally, we end up with a 
4-dimcnsional parameter space, which is sampled on a 
grid of points, for each of which we calculate M cut . 

Figure ([5]) shows the mass dependence of the cen- 
tral, satellite and total halo occupation number for our 
best fit luminosity-threshold galaxy samples. The total 
occupation number is dominated by the central galax- 
ies residing in the highly abundant low-mass haloes. 
Satellite galaxies start to dominate only for masses of 
Mw4x 10 14 h^Mn. 



C. HOD Modelling II - Luminosity Bin Sample 

To model a luminosity binned galaxy sample it is neces- 
sary to apply some minor changes to our HOD modelling. 
Firstly, the halo mass of the central galaxies will be a 
window rather than a threshold. Secondly, we must drop 
the constraint that satellite galaxies live only in haloes 
already hosting a central galaxy, because faint galaxies 
may orbit in heavier haloes (that already host a central 
above the luminosity cutoff) as satellites. For simplic- 
ity, we do not use the information about central LRGs 
from the bright sample, but rather model the two sam- 
ples independently. Again the number of central galaxies 
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FIG. 5: Mean galaxy number per halo as a function of halo 
mass for the luminosity-threshold sample and our best fit 
parameters from Table [II] We show the number of central 
galaxies (blue dash-dotted), satellite galaxies (red dashed) 
and the total number of galaxies (black solid). Note that 
the satellite number exceeds unity only for haloes with M > 
6 x 10 14 /i _1 M0, corresponding to virial radii exceeding 
r vir > 2 /i^Mpc. 



is assumed to follow a Bernoulli distribution, but with 
mean given by 



(N c , 



-erfc 
4 



In M/M, 



CLlt.l 



V2a 



rfc 



In M/M cut>2 



V2cr 



(41) 

where we have assumed that the central galaxy distribu- 
tion is symmetric in log M and that the mass-luminosity 
scatter is independent of mass. This parametrisation in- 
troduces three free parameters: [M cutj i, M cut ,2] with a 
smoothing parameter a. One may of course conceive 
more complicated window functions, however this ap- 
proach introduces the least number of additional free pa- 
rameters into the modelling procedure whilst being flex- 
ible enough to describe the data. 

For the satellite galaxy distribution, we again assume 
that the number follows a Poisson distribution, with 
mean specified by 



(N s ,t) c (M) 




M-M„ 
Ah 



if M > M min 
otherwise . 



Thus in total we must constrain six free parameters. 
However, we may reduce the dimensionality of the prob- 
lem by calculating the appropriate lower mass cutoff 
M cut ,i for each of the points in the five-dimensional 
space spanned by {M cut .2, o - , M\, Mmin, en} according to 
Eq. dSU). 
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D. Reference Sample 

In this study, we develop our analysis for ap plication to 
the SDSS spectroscopic LRG sample [H Elf. The LRGs 
are typically bright red ellipticals that are volume-limited 
within a much larger volume than the main galaxy sam- 
ple of the SDSS. Thus they are frequently used as an ef- 
ficient tracer of large scale structure. Furthermore, since 
the LRGs have been shown to live in the most massive 
haloes of the Universe 0, [M H, H , they can be effec- 
tively probed with our TV-body simulations. 

The specific LRG samples that we tunc our HODs to 
are presented in We model the galaxy-galaxy lens- 
ing from that study, along with new projected correlation 
function measurements of the same samples. In [l0| . the 
LRG samples were split into two sub-samples, LRGbright 
and LRGfaint, based on the r-band luminosities k + e- 
corrected to z = 0. LRGbright is a luminosity-threshold 
sample with a number density of n = 4 x 10 _5 /i 3 Mpc -3 , 
whereas LRGfaint is a luminosity-bin sample with a num- 
ber density of n — 8 x I0~ 5 h 3 Mpc -3 . As a result, 
different strategics must be applied when modelling the 
sub-samples, as discussed in the previous two subsec- 
tions. The LRG sample under consideration spans a red- 
shift range 0.15 < z < 0.35 with an effective redshift of 
z c s = 0.24. This effective redshift was derived from the 
lensing analysis, since higher redshift lens galaxies are 
downweighted by the lower number of source galaxies be- 
hind them. As shown in f59l Itiil ] . the clustering amplitude 
of LRGs is independent of redshift due to a subtle bal- 
ance between the redshift evolutions of bias and growth. 
Therefore we use the simulation outputs at z s - lm = 0.23, 
very close to the effective lensing redshift, for our numer- 
ical analysis. 



E. Fit results 

In Table mi we quote the inferred HOD parameters for 
the bright and faint samples when using the fiducial cos- 
mological model. We decided to use the full covariancc 
matrix for the fitting since there are non-negligible corre- 
lations between i?-bins, both in the lensing and cluster- 
ing measurements. The noise in the covariance matrix 
increases the inferred , an effect that has previously 
been investigated by [65j and we think that theoretical 
covariance predictions could improve the analysis. Im- 
perfect modelling might of course also arise from the fact 
that our simulation cosmology is not a perfect representa- 
tion of the real Universe. Due to the computational costs 
per model and the high dimensionality of the parameter 
space we have to restrict to coarse sampling of parameter 
space. It is however not our goal to precisely constrain 
the HOD parameters, but rather to obtain reasonable 
galaxy catalogues for the two LRG samples under consid- 
eration and use them to test the T statistics. We compare 
to the HOD parameters obtained by (29|, who used an 
equivalent model, finding a reasonable agreement, once 



we account for the fact that they model the full LRG 
sample. 

The galaxy catalogues have relatively low satellite frac- 
tions of ~ 4.5% for the threshold sample and ~ 10% 
for the bin sample. Fitting for the bias on linear scales 
(18 fc^Mpc < r < 90 h^Mpc) we obtain b = 2.20±0.03 
and b = 1.97 ± 0.03 for the threshold and bin sample, re- 
spectively. 





n Mi 


M cut ,i M cut ,u 


a 


a 


in 


LRGbright 


4.0 40.0 


17.8 


1.05 


1.68 


4.6 


LRGfaint 


8.0 45.0 


5.0 12.4 


0.40 


1.55 


5.7 



TABLE II: Best fit HOD parameters for the faint (f) and 
bright (b) sample: comoving number density, power law nor- 
malisation, low mass cutoff, high mass cutoff, power law ex- 
ponent, smoothing and lower satellite cutoff. The number 



densities are in units of 1 x 10 
in units of 10 13 /i _1 M B . 



h Mpc , and masses are 



VI. NUMERICAL RESULTS 

To replicate the cross-correlation coefficient between 
galaxies and matter that would be expected from real 
observational data, we use the artificial galaxy cata- 
logues described in the previous section to measure all 
of the statistics of interest. Figure [6] shows the resulting 
cross-correlation coefficients of the luminosity-threshold 
(left panel) and luminosity-bin (right panel) galaxy cat- 
alogues and two values of Rq. The ADSD is measured 
by counting the number of pairs in cylinders with length 
2Xmax = 100 /i _1 Mpc in real space. The errorbars shown 
in these figures are derived from the standard-deviation 
between the eight simulation volumes and thus represent 
the cosmic variance. We again see that AE (black), cor- 
responding to Rq = 0, leads to a cross-correlation coeffi- 
cient that is strongly scale dependent and different from 
unity, with 5-10% deviations at 10 /i -1 Mpc. However, 
if we choose Rq = 3 /i _1 Mpc, then we find that a cross- 
correlation coefficient close to unity (blue with errorbars), 
with r cc = 0.96 at 4 ft,~ 1 Mpc, as predicted by perturba- 
tion theory for biased tracers [4l|. Furthermore, we ob- 
serve this behaviour for both the luminosity-bin sample 
and the luminosity-threshold sample. This consistency 
suggests that the cross-correlation coefficient is largely 
independent of the specific choice of the HOD used to 
generate the galaxy catalogues, which is again consis- 
tent with the arguments in [4l| that the cross-correlation 
coefficient is nearly universal in the sense of being only 
weakly dependent on the halo mass. The theoretical pre- 
diction of Eq. (|3"5)) is plotted in Figure [5] as the blue solid 
line. For the latter we use £nl to predict T mm = 
and Tb = T^2. We furthermore compare to the full, 
non-expanded expression 



T 5 + aY A 



y/T^ + 2aT A + a 2 T B /2) 



(43) 
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FIG. 6: Cross-correlation coefficient of the ADSD T for the luminosity-threshold (left panel) and luminosity bin sample (right 
panel). The trivial case Ro = (black with errorbars), corresponding to bare AE, leads to a cross-correlation coefficient that is 
far from unity and furthermore strongly scale dependent. If we instead choose Ro = 3 /i -1 Mpc (blue with errorbars), inspired 
by the virial radii of the haloes under consideration, we restore a cross-correlation coefficient close to unity on the 4% level for 
all scales R > Rq. Furthermore, we can model the residual scale dependence reasonably well using the perturbation theory 
expression of Eq. (|35p (solid blue line), whereas the the bare AE deviates from the corresponding perturbation theory result 
(black dash-dotted). Since the Taylor expansion is no longer justified for scales below R = 8 /i _1 Mpc, we also plot the full 
expression according to Equation Q43p (red dashed) for T. 




FIG. 7: Reconstructed matter ADSD for the luminosity-threshold (left panel) and luminosity-bin sample (right panel). We plot 
the inferred value from the simulations including corrections for r cc 7^ 1 as well as the non-linear prediction derived from the 
measured matter-correlation of the simulations (red solid line) and the linear theory value (black dashed line). The non-linear 
correlation is well reproduced by the reconstruction, whereas there are remarkable deviations from linear theory on small scales. 

shown as the red-dash dotted line, whose range of valid- cause the difference is ~ 2% on the smallest scales con- 
ity is bounded by the breakdown of perturbation theory sidercd. 
rather than the relative magnitude of the perturbation 
terms. Given the statistical uncertainties of the direct 
simulation measurements, both expressions are viable bc- 

Thc results discussed above suggest, that we may in- 



13 



vert Eq. (fT3)) through the following, 



VII. SOURCES OF ERRORS 



T" mm (R) 



T gg (i?)r c 2 c 



oc fl 2 crl 



(44) 



The resulting statistic depends on the matter correla- 
tion function and squared matter density, which is the 
usual parameter dependence of weak lensing measure- 
ments. This dependence enables us to constrain cosmo- 
logical parameters. Our theoretical model provides us 

(T) 

with the scale dependent correction factor r^c ■ Note 
that the cross-correlation coefficient is very close to unity 
on all scales shown and even using r cc = 1 is accept- 
able given current observational constraints. However, 
future observations will measure galaxy-galaxy lensing 
with much higher statistical precision. The extraction of 
the full amount of information contained in these mea- 
surements will require an accurate modelling of r cc . As 
argued in j4lj this can be done in a relatively robust and 
model independent way. 

In Figure we show the results of such a reconstruc- 
tion based on eight galaxy catalogues with the corre- 
sponding cosmic variance errors. This reconstruction 
includes the correction for the deviations of the cross- 
correlation coefficient from unity. We see that the non- 
linear Tmm is reproduced with high accuracy, whereas 
the linear theory prediction Tmm deviates from our 
simulation result. This finding is expected, because in 
Eq. dnj} we subtract AE(i? ) at R = 3 /i _1 Mpc, which 
is already at a non-linear scale. As we go to larger scales 
this contribution is suppressed by R\jR 2 and we slowly 
approach the linear theory predictions. 

Our numerical study implicitly uses the distant ob- 
server approximation, since we project the density field 
in the simulation along one of the three Cartesian co- 
ordinate axes. In a real observation, the lines of sight 
to two nearby galaxies or to a foreground lens and a 
background source galaxy are inclined. The question 
of whether the two statistics agree is related to the ex- 
tent to which the angular and the 2D projected power 
spectra agree. As discussed in (6(|, the angular power 
spectrum corresponds to the 2D power spectrum if the 
Limber approximation [(37| is valid. The LRG sam- 
ple under consideration in our study has a median red- 
shift of z = 0.23 corresponding to a comoving distance 
Xi = 650 h M.pc. Together with the maximum pro- 
jection length Xmax = ±100 fa Mpc and the maximum 
transverse distance to the galaxy i? max = 70 hr 1 Mpc, 
this corresponds to a maximum angle of # max = 7.3°. 
The Limber approximation is typically precise to < 1% 
for I ss n/9 > 10 (68|, corresponding to 6 < 18°, and thus 
we can safely use the Cartesian analysis as an approxi- 
mation for the observations. 



Accurate studies of cosmological parameters require a 
careful consideration of all effects that might change the 
signal. In this section, we explore how large-scale red- 
shift space distortions and the difference between lensing 
and galaxy clustering window functions impacts the re- 
construction of the matter clustering. Finally, we will 
discuss how strongly the radial bins are correlated. 



A. Influence of Redshift Space Distortions 



In large redshift surveys, such as the SDSS 61| or 2dF 
[(3!|, the radial distance to a galaxy is inferred from the 
recession velocity, under the assumption of a perfect Hub- 
ble law. In reality, the coherent motions of galaxies and 
their virial motions inside haloes will add to the red- 
shift and thus distort the inferred distance. In the linear 
regime, on large scales these redshift space distortions 
can be quantified using linear theory, neglecting virial 
motions within the bound structures (for a review see 
[70l|). Following [7lj we can write the galaxy power spec- 
trum in redshift space in the plane parallel projection 
as 



p s {k) = p T {k) [i + /3fi 2 y 



(45) 



where P r (k) is the real space power spectrum of the 
tracer, [i = k ■ x/k is the position angle with respect 
to the redshift axis x and (3 = f(a)/bi(a), where /(a) = 
dlnD/dlna is the logarithmic growth rate of fluctua- 
tions. We obtain this directly by numerically evaluating 
the exact expression: 



/(a) 



1 



d In H(a) a 
dlna + (aH(a)) 3 J Q a da'(a' H(a'))- 3 



(46) 



In what follows it will be convenient to rewrite Eq. (|4"5)) 
in terms of the Legendre polynomials Li(/j,) 

P s (k) = P T (k) [a L (n) + a 2 L 2 (ti) + a 4 L 4 (/i)] , (47) 

where the coefficients are given by: 

(48) 
(49) 
(50) 
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The redshift space correlation function is then obtained 
by a Fourier transform of the power spectrum: 



£gg,s(7,^) 



V 



(2tt) 3 



dkk 2 P Y (k) I d(i[l + /3/j, 21 



r-2-K 

I dtpexp [ik ■ x] 
Jo 



(51) 
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FIG. 8: Left panel: Residual effect of redshift space distortions on the projected galaxy-galaxy auto-correlation function w m . 
We show the simulation measurements for the bright sample as crosses with errorbars for Xmax = 50 h~ Mpc (upper blue) and 
Xmax = 100 h~ 1 Mpc (lower red) and the corresponding linear theory predictions. Right panel: Residual effect on the annular 
differential surface density T gg (R; Ro = 3 /i~ 1 Mpc) for the same integration lengths. As on the left panel, the upper blue curve 
and points correspond to Xmax = 50 /i~ 1 Mpc whereas the lower red curve and points correspond to Xmax = 100 /i _1 Mpc. 
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where v is the angle between r and the axis along which 
the redshift space distortion is present, i.e. v = x ■ r = 
x/r. The correlation function multipoles in the above 
equation are defined as 



V f 00 

f«(r) = (-1)'^ J dkk 2 P t {k) ]2l {kr) , 



(53) 



where the ji are the spherical Bessel functions: ji(x) = 
Jt+i/2( x )/V^x- We note that the above formulae are 
equivalent to the formulation of [72|. 

For our investigations, we are mainly concerned with 
the projected correlation function or the closely related 
excess surface mass density. The common assumption is 
that the integration along the line of sight removes red- 
shift space distortions. This assumption, however, would 
only be correct in the limit of an infinite radial projec- 
tion window, which is not used in practice. Integrating 
Eq. (|52")) along the line of sight, we obtain 



w gg:S (i?) = / £gg, s (r,i/)dx 

"Xmax 

2 



C Xmax 

i=o ^° 



x/x 2 + R 2 



(54) 



We shall use the above result to calculate the linear the- 
ory predictions for the projected correlation functions 
in redshift space. In Figure [HI we plot the ratio of 
the real to redshift space projected correlation functions 
for the bright LRG galaxy sample with b = 2.2. We 
clearly see that the commonly used integration length of 
Xmax = 50 /i - 1 Mpc leads to residual distortions of about 
40% on scales R « 50 h~ 1 Mpc. These residual redshift 
space effects on the projected correlation function were 
previously discussed by [73|,[7|| (see also [75j]). Moreover, 
we see that on these scales the linear theory prediction 
is a very good description to the effects that we observe 
in our simulations. The difference between linear the- 
ory and simulation, on small scales, arises from the fact 
that we do not model the virial motions, which cause 
the fingers-of-god. Furthermore, the non-linear correla- 
tion function is more cuspy than the linear correlation 
function on small scales. Therefore the linear predictions 
in redshift space are boosted in amplitude by the com- 
pression along the line of sight. The non-linear projected 
correlation function is however much more influenced by 
the increased small-scale clustering, and thus at small 
separations transverse to the line of sight, it is less sen- 
sitive to the compression. 

Even though the linear prediction is a good description 
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FIG. 9: Correction factors that must be applied to the galaxy clustering measurements in order to remove redshift space 
distortions and to construct a quantity equivalent to the lensing signal. The total correction (red solid) is a product of a 
window correction (blue dashed) and a redshift correction (green dash-dotted). Left panel: Projected galaxy-galaxy auto- 
correlation function w gg . The size of the corrections is quite remarkable on the largest scales. The apparent effects on scales 
below 10 /i -1 Mpc arise from the flattening of the linear power spectrum on these scales. Furthermore the correlation on these 
scales would be affected by the finger-of-god effects not included in our analysis. Right panel: Annular differential surface 
density for Rq = 3 /i -1 Mpc. The residual correction is much smaller, 3% on the typical scales probed by galaxy-galaxy lensing. 



of the effect, removing it requires knowledge of the red- 
shift space distortion parameter /3, which requires knowl- 
edge of both the cosmological model and bias. Since these 
are not known a priory but instead they must be deter- 
mined from the data. To do this accurately an iterative 
approach is needed, which complicates the analysis and 
ultimately limits the precision. Thus, it is advantageous 
if these corrections can be made as small as possible from 
the onset. As we show in the right panel of Fig. for 
the ADSD statistic T much smaller residual corrections 
are required. The reduction is dramatic, with an order 
of magnitude smaller effect at the same scale and for 
the same radial window. As discussed above, this reduc- 
tion results from the compensated nature of these statis- 
tics, which makes them much less sensitive to the long 
wavelength fluctuations, so that the limit x max oo 
is approached faster. This makes these statistics more 
attractive for practical applications than the projected 
correlation function w. In the context of galaxy cluster- 
ing, similar compensated statistics have been proposed 
in [7a]. Note that for lensing the typical radial window 
is hundreds of h~ 1 Mpc wide, and for AE gm or T gm the 
effects of redshift space distortions can be completely ne- 
glected on scales below R « 100 /i _1 Mpc. 

The analytical predictions for the impact of redshift 
space distortions on T and w are based on the Kaiser 
model and thus make use of the flat sky and distant ob- 
server approximation. However, as we showed above, the 



ADSD is very robust to redshift space distortions with 
residual corrections on the 0(1%) level. Due to the small- 
ness of the correction and since our study is restricted to 
transverse separations that are small compared to the 
line of sight distance to the galaxies, our treatment is 
justified. 



B. Dependence on projection length 

Our final goal is to compare a galaxy-matter cross- 
correlation corresponding to a very broad lensing win- 
dow and a galaxy-galaxy auto-correlation that is calcu- 
lated from a narrow top-hat window and thus, in contrast 
to the lensing, subject to redshift space distortions. In 
this context it is necessary to devise a correction that 
accounts for both the redshift space distortions and the 
different window functions. We already discussed the 
redshift space effects and saw that they can be described 
by a scale dependent factor Ygg,r /Ygg 1 ," given by lin- 
ear theory. Here we use the superscript "clust" to de- 
note that this statistic is measured with the top-hat win- 
dow. A similar numerical factor can be used to transfer 
from the clustering to the lensing window T g g" s '/T g g,r St \ 
The corrected galaxy correlation function corresponding 
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FIG. 10: Columns extracted from the correlation matrix of the bright LRG sample Left panel: Corr(u>) Central panel: Corr(AE) 
Right panel: Corr(T), where the vertical dashed line shows the cutoff radius Rq. Comparing the correlation matrices for w 
and AS it is clear that the width of the off-diagonal contributions is larger for the projected correlation function than for the 
excess surface mass density. The same remains true if one compares the correlation of the ADSD and the projected correlation 
function. 



to the lensing measurements then reads as: 



Covariance matrix 



'Y(lcns) 

gg.r 



Y<clust) ± Si 
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(clust) 



1 gg,s 
^(clust) 
y (clust) 1 gg.r 
gg.s ^(clust) 
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(55) 
(56) 



rcdshift integration length 



where Tgg's St ' ) is the statistic that is measured in the clus- 

, j ^(lens) i j i ~v~(lcns) 

tcrmg survey and I ggi r can be compared to I gm ,r mea- 
sured from lensing. 

Figure |9] shows the correction terms for the bright LRG 
(b = 2.2) sample both for the projected correlation func- 
tion w and the ADSD T. The integration length correc- 
tion is shown as a blue dashed line. The window cor- 
rection was obtained from comparing the linear theory 

predictions for T g g" s ' ) and T g g 1 " st ' ) . Again we see that 
the resulting corrections are much smaller for T than for 
w. This occurs for the same reason as discussed above 
in the context of rcdshift space distortions: by using a 
compensated window the sensitivity to long wavelength 
modes is removed and the limit Xmax — > oo is approached 
faster, at which point the differences between different 
radial integration lengths disappear. 

Figure [9] also shows the rcdshift factor as a green dash- 
dotted line and the final correction as a solid red line. 
The redshift correction is the inverse of the curve plotted 
in Fig. [SI We see that the projection length and redshift 
space effects go in the opposite direction. This partial 
cancellation further minimises their effect, so that for the 
ADSD T, their combined effect is less than 3% even at 
R w 50 h^Mpc. 



Another benefit of the compensated ADSD is that 
its correlation matrix Corr^ = (T^Tj) / yj (TjTj) (TjTj) 
has weaker off-diagonal contributions than that for the 
projected correlation function. Usually two point statis- 
tics such as the correlation function show strong corre- 
lations between different radial bins, i. e. important off- 
diagonal entries in the covariance matrix. The compen- 
sated window Eq. (|17p relating the ADSD to the corre- 
lation function reduces these off-diagonal contributions 
remarkably. This statement refers to the cosmic variance 
contribution to the covariance matrix only. The shape 
noise adds predominantly to the diagonal error and thus 
further reduces the off-diagonals of the correlation ma- 
trix. 

In Figure [TU] we show columns extracted from the cor- 
relation matrices of w, AS and T, respectively, for the 
bright LRG sample. The covariance matrix is estimated 
by calculating the variance over 160 subvolumes of 750 x 
750 x 300 h~ 3 Mpc 3 . From this plot it is obvious that the 
off-diagonal contributions to the covariance matrix are 
reduced as one transitions from the projected correlation 
to the excess surface mass density and ADSD. One would 
expect some additional covariance due to the subtraction 
of AE(i?o) in T, but it turns out that the reduced off- 
diagonal covariance remains for the ADSD. We compare 
the signal-to-noise (S/AOae = £V . AX^C^AE,- and 
(S/N)y = V ( ^ T,C ' j ; : T ,. where C are the covariance 
matrices of AE and T, respectively. The sum runs over 
radial bins with Ri > 1 /i _1 Mpc for AE and Ri > Ro for 
T. We see that the signal-to- noise ratio is degraded by 
a factor 0.38 < {S/N) r /(S/N) AS < 0.45 over a range 
of cutoff radii 5 /i -1 Mpc > R > 1 h~ 1 Mpc. So the ad- 
vantage of being able to interpret the result in terms of 
perturbation theory just has to be paid by a factor 2 — 3 
decrease in signal-to-noise. 
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VIII. VARIANT COSMOLOGIES 

In the previous sections, we presented results for our 
mass clustering reconstruction for one specific cosmolog- 
ical model. In this section, we explore how well the T mm 
reconstruction performs for four variations to our fiducial 
model. 

The four variations are presented in Table HI and we 
denote them by C1-C4. Each of these models differs from 
the fiducial model in exactly one parameter, and for each 
variation we have performed four simulations providing 
a volume of V = 13.5 h~ 3 Gpc 3 . We populate these 
simulations using the same HOD parameters inferred for 
the luminosity-threshold LRG sample as described in ffV] 

Figure [TT] shows the cross-correlation coefficient in- 
ferred from the statistic T, for Rq = 5 h~ 1 Mpc for the full 
LRG sample. The reason for increasing the cutoff radius 
is that the cluster masses for some of the variant cosmolo- 
gies are increased and thus we apply this more conserva- 
tive cutoff radius to ensure r cc « 1 . We then see that the 
variation of cosmology does not significantly change the 
general trends observed for our fiducial model. We notice 
that there is a weak scale dependence, and in all cases 
the trend to lower r cc is well described by the theoretical 
model given by Eq. (J43J , which is over-plotted as a green 
line. There appears to be some small (~ 5%) discrepancy 
for the higher as model at smaller scales R < 10 ft,~ 1 Mpc, 
where we see an increase in the cross-correlation coeffi- 
cient. This is likely due to the fact that the cut-off scale 
Ra , is actually less than twice the virial radius of the most 
massive haloes, and so the statistics are still sensitive to 
the internal structure of the haloes. 

We examined whether the agreement might be further 
improved through using only the central LRG galaxies in 
the reconstruction. The results of this test are presented 
in Figure [T2l and we indeed find better agreement. We 
believe that this is due to the fact that the influence 
of satellite-satellite pairs from massive clusters has been 
removed. This means that the central galaxy sample is 
closer to a mass-selected halo sample and is thus less 
influenced by the details of how galaxies populate haloes. 

A simple way to further reduce this sensitivity is to 
eliminate the most massive haloes from the data. Since 
these contain many galaxies, they are easy to identify 
in an observation. In Figure [131 we plot the cross- 
correlation coefficient of a halo sample, from which we 
removed all the clusters with mass exceeding M > 3 x 
10 14 Ii~ 1 Mq and all the central galaxies. Having re- 
moved the clusters, we can lower the cutoff radius to 
Rq = 3 /i _1 Mpc. Clearly the cross-correlation coeffi- 
cient shows stronger deviations from unity. These are, 
however, better reproduced by our model than the full 
or central sample. To achieve this agreement we needed 
to change the bias ratio a accounting for the new up- 
per mass threshold. The corrections in Figure [T3] use 
a = 0.41 instead of our fiducial choice of a = 0.26. 

Another way to improve agreement between theory 
and simulation is to use a phenomcnological correction 



factor. Inspired by the fact that the correction for the 
cross correlation coefficient r^c in Equation (|3"Tj) is pro- 
portional to the correlation function, we can simplify 

our correction factor using the approximation r cc (R) ~ 
1 — a 2 £(i?/2)/4. The argument R/2 in the correlation 
function can be motivated considering the window for 
the correlation function plotted in the right panel of Fig- 
ure [2] There we see that the windows for £(i?/2) and 
T(i?) peak at approximately the same scale in k-space. 
We over-plot the phenomenological correction as the red 
dashed line in Figure [T3] With this replacement we can 
slightly improve the agreement between theory and mea- 
surement. 

In Figure [TT] we reconstruct the matter correlation 
T mm (-R) from the simulation measurements of T gm (i?) 
and T gg (i?) as the points with errorbars. For this plot we 
use the full galaxy samples, whose cross-correlation coef- 
ficient was shown in FigurcQTJ We see that the non-linear 
matter correlation function is reproduced for all the four 
variant cosmologies. Furthermore, there are clear differ- 
ences both in shape and amplitude between the different 
cosmologies, so that inference of cosmological parame- 
ters should be feasible. Differences in f2 m and eg are 
more prominent than the effect of changing the slope of 
the primordial spectrum n s . The small discrepancy be- 
tween the simulations and the theoretical prediction for 
the high as model in Fig. [TT] translates into a tension 
between inferred and real matter ADSD. 

The lower panel of Figure [14] emphasises the possibil- 
ity of inferring cosmological parameters by showing the 
fractional differences in the recovered ADSD for the dif- 
ferent cosmologies CI - C4 with respect to the fiducial 
model. Variations in the slope of the power spectrum 
differ from the fiducial model only on the 5% level at 
R = 30 /i _1 Mpc, whereas the quadratic dependence of 
the estimator on og^m leads to a clear separation of the 
variant erg and f2 m models from the fiducial model (25% 
at R = 30 h~ 1 Mpc). If the lensing study extends to suf- 
ficiently large scales as and fi m are separable by their 
shape. Here we are using the fact that a change in O m 
or erg affects the amplitude as well as the shape of the 
correlation function. 

One caveat is that the inference of T mm requires the 
assumption of an a priori cosmology. This assumption 
enters the reconstruction in three places. Firstly, we are 
using the clustering and lensing measurements as a func- 
tion of the distance transverse to the line of sight. The 
observation, however, provides both clustering and tan- 
gential shear distortions as a function of angular separa- 
tion. To relate the two, one needs to calculate the an- 
gular diameter distance to the foreground galaxy sample, 
which depends on f2 m . A wrong prior on the cosmological 
model would thus cause a horizontal shift in the inferred 
statistic. Secondly, the definition of the excess surface 
mass density includes the critical surface mass density, a 
ratio of the angular diameter distances to the lens, the 
source and between the two. The latter affects the ampli- 
tude of T mm in quadrature. Thirdly, we use cosmology to 
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FIG. 11: Cross-correlation coefficient of the ADSD T for the full galaxy sample. The panels show simulation output as circles 
with errorbars and theoretical predictions of Eq. (|35|) for Ro = 5 h~ 1 Mpc (red solid lines) . Top left panel: Reduced spectral index 
n s = 0.95 Top right panel: Increased normalisation as = 0.9 Bottom left panel: Reduced matter density ri m = 0.2, Q,a ~ 0.8 
Bottom right panel: Increased matter density fi m = 0.3, S1a = 0.7 We see that the increased number of high mass haloes in 
the f2 m = 0.3 and erg = 0.9 models leads to a higher number of satellite galaxies and thus partially compensates the drop of 
the cross-correlation coefficient for haloes on small scales. 
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FIG. 12: Same as Fig. [TT]but for the central LRGs. The central LRGs are a cleaner representation of a halo sample and thus 
better reproduce our theoretical expectations. 
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FIG. 13: Same as Fig. 1111 but for Ro = 3 h~ Mpc and for LRG sample from which the clusters with mass exceeding 
M = 3 x 10 14 h~ 1 Mq ) and all satellites were removed. Due to the cluster subtraction the bias ratio changes to a = 0.41. In 
addition to the usual correction (red solid line) we also plot the phenomenological correction r cc ~ 1 — c* 2 £(i?/2)/4 (blue dashed 
line) The cluster subtraction also allowed us to reduce the cutoff radius to Ro = 3 /i Mpc. 



compute the cross-correlation coefficient (Eq. |35|) . This 
also only has a weak dependence on cosmology, since the 
cross-correlation coefficient is close to unity to start with. 

In order to estimate the magnitude of the first two 
effects we pose the following question: How is the in- 
ferred statistic for i7 m = 0.2 or fi m = 0.3 affected if 
we wrongly assume the fiducial cosmology, Q nl = 0.25, 
for the measurement? As a reasonable example, we take 
z\ = 0.25 and z s = 0.5, for the lens and source redshifts. 
For these cases, we obtain a 2% increase (decrease) in 
S cr ; t for fl m = 0.2 (fim = 0.3) with respect to the fidu- 
cial fl m = 0.25. These results are shown as the thin 
lines in the lower panel of Fig. [TU The shift caused by 
the cosmology dependence of the angular diameter dis- 
tance to the lens galaxy has a smaller effect and is on 
the order of ~ 1%. One route to remove part of this 
dependence from the measurement is to change the es- 
timator T mm — > T mm /E^ rit . This can be done by writ- 
ing Tgm = S„it [lt( R ) - R o/ R2 lt(Ro)] and substituting 
this expression into Eq. (|44l) 

Tmm(fl) = [j t (R)-R 2 /R 2 jt(Ro)] 2 

The benefit of this redefinition is that the quantity we 
compare to theory has one cosmology dependence less, 
and S cr it can be calculated for each tested cosmological 



model. However, the angular diameter distance still de- 
pends on fl m . We could introduce another factor that 
takes care of this dependence, but for SDSS data at low 
rcdshift the effect is small. In general one can use an it- 
erative procedure or check whether within the errors on 
fi m the effects exceed observational errors. A similar it- 
erative procedure can be used to verify the sensitivity to 
the assumed value of the cross-correlation coefficient in 
the reconstruction. 

Reconstruction Procedure 

To conclude, we summarise our procedure for inferring 
the matter clustering from lensing and clustering mea- 
surements in terms of the following five steps: 

1. Measure galaxy- galaxy lensing signal 7t for a cer- 
tain lens galaxy sample, and calculate AS gm (i?) 
from the tangential shear. This first step requires 
the assumption of an a priori cosmological model 
that has to be confirmed or refuted by the final 
result. 

2. Measure the galaxy-galaxy clustering of the lens 
galaxy sample and calculate the projected correla- 
tion function w gg (R). Integrate the result to obtain 
A£ gg (i?). 
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FIG. 14: Top panel: Reconstructed matter ADSD of the vari- 
ant cosmologies for Ro — 5 h~ Mpc. The points with error- 
bars show the simulation results for the four variant cosmolo- 
gies as measured in the simulations, whereas the solid lines 
show the corresponding non-linear matter correlation func- 
tion. From top to bottom: f2 m = 0.3 (black stars), as = 0.9 
(green squares), n s = 0.95 (red circles) and f2 m = 0.2 (blue 
diamonds). The thick black line is the non-linear matter cor- 
relation function of our fiducial model plotted here for refer- 
ence. Bottom panel: Fractional difference of the reconstructed 
matter statistics with respect to the fiducial model. From top 
to bottom we show Sl m = 0.3 (solid black), as = 0.9 (green 
dotted), n B — 0.95 (red dash dotted) and Sl m = 0.2 (blue 
dashed). For the fi m = 0.2 and f2 m = 0.3 cosmologies we also 
include the effect of wrong a priori cosmology as thin lines 
with corresponding line style (for further discussion see text). 



Estimate the typical host halo virial radius of the 
galaxy sample under consideration. Use this esti- 
mated Rq to correct for the central contributions 
in AEab(-R) by calculating T AB (R) = AS AB (i?) - 
AS AB (i? )^/i? 2 . 

Make predictions for the transfer function and re- 
sulting matter auto-correlation functions for a set 
of cosmological parameters and/or modifications of 

gravity. Use these to calculate lSim°' and find the 
best fit parameters by comparison to the empirical 
result. 



5. Iterate until convergence. 



IX. CONCLUSIONS 

In our study, we examined how well can one recon- 
struct the dark matter clustering from observations of 
galaxy clustering combined with galaxy-galaxy lensing. 
This reconstruction procedure could for instance be ap- 
plied to the SDSS galaxy survey, in particular the Lu- 
minous Red Galaxies. In a first step, we generated 
realistic LRG galaxy catalogues for both a luminosity- 
threshold and a luminosity bin sub-sample of the LRGs. 
We then used these galaxy catalogues to extract infor- 
mation about the cross-correlation coefficient between 
galaxies and matter. 

We introduced a new statistic T(R), which we termed 
the Annular Differential Surface Density (ADSD), that 
removes the influence of small, non-linear scales on the 
excess surface mass density. This subtraction is neces- 
sary since the scales smaller than the virial radius of the 
haloes are dominated by the halo profile rather than the 
pre-shell-crossing evolution of the large scale cosmologi- 
cal fluid. Both numerical studies and theoretical calcu- 
lations indicate that the cross-correlation coefficient of 
the ADSD is close to unity and that the residual scale 
dependence is well described by an analytic correction. 
Having focused our investigations on the excess surface 
mass density AE gm , our results can be directly applied to 
measurements of galaxy-galaxy lensing and the projected 
galaxy correlation function. 

We also studied systematic effects that might bias the 
comparison of lensing and galaxy clustering measure- 
ments. In terms of the projected correlation function, 
both numerical studies and a linear theory treatment, 
following (7lj . show that the common integration over 
±50 /i -1 Mpc along the line of sight is still biased by red- 
shift space distortions. We have shown the necessity of 
correcting for the large scale peculiar motions in any such 
clustering measurement. We also investigated the effect 
of different window functions used for lensing and clus- 
tering. These introduce additional effects, that must be 
accounted for in the final analysis. We have found that 
the ADSD statistic T(i?) is much less sensitive to both of 
the effects, and to long wavelength modes in general, than 
the usual projected correlation function w(R), because of 
the (partially) compensated nature of its transverse win- 
dow. 

As our key result, we devised a method to recover 
the dark matter clustering from galaxy-galaxy lensing 
and galaxy clustering measurements using the cross- 
correlation coefficient for the ADSD. Assuming r cc = 1 
for simplicity leads to at most 8% bias on scales below 
R rj 5 fo _1 Mpc in the recovered statistic T mm (R). We 
can however remove this bias based on our theoretical 
modelling of the scale dependence of the cross-correlation 
coefficient. The main advantage of our method is that the 
galaxy dependence is scaled out of the equations, since 
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the theoretical model predictions for the cross-correlation 
coefficient between haloes and dark matter are relatively 
independent of the halo mass over a wide range of mass. 
Thus, we believe that the method devised here is more 
robust than the methods which are based on HOD fit- 
ting (e. g. [7^1). which fit for the cosmological parameters 
and the HOD parameters jointly, marginalising over the 
uncertainties in the HOD parameters. 

A study on four other cosmological models verified the 
robustness of the new estimator. Varying one parame- 
ter of the fiducial ACDM model at a time, we found that 
the cross-correlation coefficient shows a scale dependence 
consistent with the fiducial model. We were able to re- 
construct the ADSD of the matter correlation function, 
and the inferred statistic T mm can be used to distinguish 
cosmological models both, from the shape and the am- 
plitude of the recovered statistic. This study also showed 
that, if one is capable of accurately distinguishing cen- 
tral from satellite galaxies and/or remove clusters, then 
one can eliminate the influence of satellite galaxies and 
so render the cross-correlation coefficient closer to the- 
oretical predictions for haloes in numerical simulations. 
These advantages make it worthwhile to define a clean 
central galaxy sample and to remove the clusters. While 
the non-linear matter correlation can be recovered with 
high fidelity, the linear correlation is only recovered at 
large scales. This fact strengthens the need for a well 
developed and tested perturbation theory of large-scale 
clustering that extends into the weakly non-linear regime 
and which can thus provide us with an estimator of £nl 
without having to carry out simulations for each cosmo- 
logical model. 

The ADSD statistic subtracts out a lensing signal at 
Ro (Eq. [T^]). This subtraction procedure decreases the 
signal-to-noise on the inferred statistic T as compared 
to AE. This price seems worth paying, since it brings 
the cross-correlation coefficient much closer to unity with 
residual deviations from unity that are well understood 
theoretically. An application of this method to observa- 



tional data will have to address the problem of estimating 
AE(-Ro). The cubic spline fit used for our numerical stud- 
ies will not be appropriate given the large statistical fluc- 
tuations in observed lensing signal. Several alternatives 
to estimate A£(i?o) ar e explored in [42|], with the most 
successful being a fit with a running power-law (three 
parameters) to the radial bins around Rq. 

Our numerical results arc based on the SDSS spectro- 
scopic LRG sample, i. e. on the galaxies living in the most 
massive haloes. Based on the success and generality of 
the theoretical model we expect that a similar behaviour 
for the Main spectroscopic sample galaxies in the SDSS, 
which live predominantly in lower mass haloes. The lower 
halo masses may also enable a lower cutoff radius Rq , es- 
pecially if haloes with higher mass arc effectively removed 
from the sample. If the halo sample spans a wide range 
of masses, it should be split into mass bins and Rq should 
be chosen appropriately for each of the mass bins. We 
shall reserve this topic for future investigation. 
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